PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Zpz_sc04592.1.g00020.1.sm.mkhc
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Chloridoideae; Zoysieae; Zoysiinae; Zoysia
Family HD-ZIP
Protein Properties Length: 853aa    MW: 92957.2 Da    PI: 7.0467
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Zpz_sc04592.1.g00020.1.sm.mkhcgenomeZGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox47.43.3e-152475451
                                    -SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHH CS
                        Homeobox  4 RttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNr 51
                                      ++t+eq+e+Le++++++++p+  +r++L + +    +++ +q+kvWFqNr
  Zpz_sc04592.1.g00020.1.sm.mkhc 24 YVRYTPEQVEALERVYHECPKPTSLRRQQLIRDCpilsNIEPKQIKVWFQNR 75
                                    6789***********************************************9 PP

2bZIP_121.16.8e-07841301763
                                     HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH CS
                          bZIP_1  17 ArrsRqRKkaeieeLeekvkeLeaeNkaLkkeleelkkevaklksev 63 
                                     A r+R++ ++e  +L++   +L+a Nk L +e+++l+k+v++l +++
  Zpz_sc04592.1.g00020.1.sm.mkhc  84 AGRCREKQRKESSRLQTVNRKLSAMNKLLMEENDRLQKQVSRLVDDN 130
                                     67**************************************9997765 PP

3START165.34.3e-521833882202
                                     HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEE CS
                           START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakae 80 
                                     +aee+++e+++ka+ ++ +Wv+++ +++g++++ + + s+++sg a+ra+g+v  ++a  v+e+l+d++ W +++++++
  Zpz_sc04592.1.g00020.1.sm.mkhc 183 IAEETLTEFMSKATGTAVNWVQMVGMKPGPDSTGITAVSHNCSGVAARACGLVSLEPA-KVAEILKDRASWYRDCRRVD 260
                                     6899******************************************************.8888888888********** PP

                                     EEEEECTT..EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEE CS
                           START  81 tlevissg..galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgil 153
                                     +l vi +g  g+++l++++++a+++l+  Rdf+++Ry+  l +g++vi+++S++  +  p    s+++ Rael+pSg+l
  Zpz_sc04592.1.g00020.1.sm.mkhc 261 ILHVIPTGngGTIELIYMQTYAPTTLAEpRDFWTIRYTSGLDDGSLVICERSLTKSTGGPCgpnSPNFTRAELFPSGYL 339
                                     ***********************999866****************************9887788*************** PP

                                     EEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXX CS
                           START 154 iepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqr 202
                                     i+p+++g+s + +v+hvdl++++++++lr+l++s  + ++k++va++++
  Zpz_sc04592.1.g00020.1.sm.mkhc 340 IRPCEGGGSMIYIVDHVDLNAWSVPEVLRPLYESPKILAQKMTVAAMRH 388
                                     **********************************************987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007112.391875IPR001356Homeobox domain
SMARTSM003891.0E-82096IPR001356Homeobox domain
SuperFamilySSF466891.28E-122276IPR009057Homeodomain-like
CDDcd000862.29E-132393No hitNo description
PfamPF000468.2E-132475IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.603.4E-152675IPR009057Homeodomain-like
CDDcd146862.36E-586124No hitNo description
PROSITE profilePS5084827.862173392IPR002913START domain
Gene3DG3DSA:3.30.530.204.8E-24181365IPR023393START-like domain
SMARTSM002344.8E-42182392IPR002913START domain
SuperFamilySSF559611.51E-35183392No hitNo description
PfamPF018527.1E-50183389IPR002913START domain
SuperFamilySSF559613.02E-5430516No hitNo description
SuperFamilySSF559613.02E-5544616No hitNo description
PfamPF086709.6E-35717852IPR013978MEKHLA
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 853 aa     Download sequence    Send to blast
MAAVASRERR LSPASAPHVD TGKYVRYTPE QVEALERVYH ECPKPTSLRR QQLIRDCPIL  60
SNIEPKQIKV WFQNRSTEHC LLDAGRCREK QRKESSRLQT VNRKLSAMNK LLMEENDRLQ  120
KQVSRLVDDN GCMRNQLHKA SAAITDTSCE SVVTSGQHHL QQNQPVLHPL QRDANNPAGL  180
LAIAEETLTE FMSKATGTAV NWVQMVGMKP GPDSTGITAV SHNCSGVAAR ACGLVSLEPA  240
KVAEILKDRA SWYRDCRRVD ILHVIPTGNG GTIELIYMQT YAPTTLAEPR DFWTIRYTSG  300
LDDGSLVICE RSLTKSTGGP CGPNSPNFTR AELFPSGYLI RPCEGGGSMI YIVDHVDLNA  360
WSVPEVLRPL YESPKILAQK MTVAAMRHIR QLALESSVEI LYSARLQPAV LRTICQRLSR  420
GFNDAVSGFS DDGWSLLSSE GSEDITISVK SCPSKLDGFC VSTSPFFSAI GGGIVCAKAS  480
MLIQNVPPAL LVRFLREHRS QWADPGVDAY SASSLRTSQY TIPGLRGGGF IGGQATVPLA  540
ETIDHEESLE IVKLEGNGFG HDDVLPRDML LLQLCSGVDE SAPGACAQLV FAPIDGSFTD  600
DAPLLPSGFR VLPLDGKADA PSATHTLDLA SSLEVGAGGA LHASKGTPSV GNVRSVLTIA  660
FQFSFENHLR ESVAAMARQY IRAVMASVQR VAMAISPSRI GLQMEMKHPP GSPEAQTLAR  720
WISRSYRVHT GTEIRWSDNK GTESPLELLW KHSDAASPML MFANSAGLDI LETTLINIQD  780
MPLETVLGDK GQKALFLELP NIMNKGFASL PRGVCKSSMG RQASYEQAVA WKVLGDDGAP  840
HCLALMLVNW TFI
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002443548.10.0hypothetical protein SORBIDRAFT_08g021350
SwissprotA2ZMN90.0HOX33_ORYSI; Homeobox-leucine zipper protein HOX33
SwissprotQ2QM960.0HOX33_ORYSJ; Homeobox-leucine zipper protein HOX33
TrEMBLC5YRY30.0C5YRY3_SORBI; Putative uncharacterized protein Sb08g021350
STRINGSb08g021350.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP37438197
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G34710.10.0HD-ZIP family protein